DOCUMENT RESUME 



ED 357 056 



TM 019 776 



AUTHOR Gillespie, Maggie; Noblo, JuHd 

TITLE Factors Affecting Student Persistence: A Longitudinal 

Study. 

INSTITUflON American Coll. Testing Program, lovia City, Iowa. 
REPORT NO ACT-RR-92-4 
PUB DATE Nov 92 

NOTE 47p. 

AVAILABLE FROM ACT Research Report Series, P.O, Box 168, Iowa City, 
I A 52243. 

Reports - Research/Technical (143) 



PUB TYPE 

EDRS PRICE 
DESCRIPTORS 



IDENTIFIERS 



MF01/PC02 Plus Postage. 

^'Academic Persistence; College Attendance; '^College 
Freshmen; Data Collection; Dropout Research; Higher 
Education; High Risk Students; ^Institutional 
Characteristics; Longitudinal Studies; Potential 
Dropouts; ^Predictor Variables; Regression 
(Statistics); School Holding Power; ""jtudent 
Attrition; Student Characteristic;:; Withdrawal 
(Education) 
*Tinto Model 



ABSTRACT 
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identifying high-risk students were calculated from the probabilities 
generated by logistic regression. Results support Tinto's view that 
persistence models are specific to individual institutions and to the 
time period being examined. An example is provided for using the 
results to identify high-risk students and to develop intervention 
strategies targeting key factors related to student retention. A 
discussion of the practical issues involved in collecting retention 
data is also provided. Fourt^een tables present study findings. 
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FACTORS AFFECTING STUDENT PERSISTENCE; 
A LONGITUDINAL STUDY 



Maggie Gillespie 
Julie Noble 



ABSTRACT 



The purpose of this study was to identify student and institutional characteristics related to college 
freshman persistence. Persistence was exan\ini2d for five institutions at four paints in time: end of first 
term, re-enrollment in the spring term, end of sjjring term, and re-enrollment in the fall of the sophomore 
year. Data from a variety of sources wore used; predictor variables were selected to represent the 
components of Tinto's model of persistence. Both linear and logistic regression were used to develop 
separate prediction models for each institution. Estimated success rates and accuracy rates of the models 
for identifying high-risk students were calculated from the probabilities generated by logistic regression. 

The results supported Tinto's view tliat persistence models are specific to indi\'dual institutions 
and to the time period being examined. An exjimple is provided for using such results for identifying 
high-risk students and for developii 'ntervention Sitrategies targeting key factors related to student 
retention. A discL-.ssion of the practical issues involved in collecting reteution data is also provided. 



FACTORS AFFECTING STUDENT PERSISTENCE: 
A LONGITUDINAL STUDY 



Student retention is a major and on-going concern at postsecondary tnstiiutions. With restrictions 
on financial resoi"»rces and decreases in the traditional -ago college freshman pool, institutions are striving 
to find ways to identify and retain potential H.opouts. An effective identification process would enable 
an iniititution ti> assist potential dropouts through advising, counseling, or other interventions. Tliis could 
rcfault in a more productive and satisfying ed jcational experience for students, and an improved retention 
rat'; for the institution. 

Earlier Research 

Numerous studies have been conducted to examine student retention; several propose theoretical 
models to explain student retention (e.g., Spady, 1971; Pascarella, 1980; Bean, 1986). The model developed 
by Tinto (1975) is probably the one moit widely tested (e.g., Terenzini & Pascarella, 1977; Munro, 1981; 
Pascarella & Terenzini, 1983; Noble, 1988; Halpin, 1990). Tinto's model emphasizes integration and 
a)mmitmenl: students' background characteristics (family background, individual attributes, and 
precollegc schooling) interact and influence students' initial commitment to the goal of college completion 
(goal commitment) and initial commitment to the institution (institutional commitment). These 
commitments in turn influence students' intellectual development and academic performance, which 
determine academic integration. Goal and institutional commitment also influence peer group and faculty 
interactions, which determine one aspect of social integration. Increased academic integration and social 
integration lead to greater goal commitment and institutional commitment, which reduce the probability 
of the student dropping out. 

Though many studies have examined student persistence, they have often nut included one or 
mure important factors: a focus on the relationship between student characteristics and institutional 
environment, an adequate definitinn af "dropout," a theoretical framework to explain the attrition process, 
appropriate multivariate statistical techniques, and the use of several institutions or a representative 



sample to support generalizablc results (Pallett, 1984). In addition, the majority of tilie studies do not 
include all of the major components that research has shown to be related to persistence (background 
cliaracterisHcs, initial and subsequent institutional and goal commitment, and soda! and academic 
integration). The fev.* studies that have included all of these components typiCv,lly used a limited sample 
of students from a single institution and obtained relatively low response rates. 

Tinto (1975) argued that his model was an institutional model, not a model 1 ir general use across 
all institutions. He emphasized the importance of thie i' iter actions of the individual student with the 
institution. This argument was supported by Bean (10tJ6), who stated that the heterogeneity of students 
and institutions "indicates that a single model of attrition will tend to work poorly in explaining the 
dropout process for individual students at particular institutions" (p. 49). These views support the practice 
of developing attrition models and measures of student /institution fit for specific institutions or types of 
institutions. 

Cope (1978) emphasized the need for an early iderdfication of potential dropouts, with the use 
of "readily ascertainable" characteristics of students (e.g., high school background, demographic variables), 
Lenning (1982) also supported the use of easily-obtained data, with the view that this type of data might 
provide comparable accuracy to other more costly and difficult measures. The data currenUy available 
from college admissions tests like the ACT Assessment are easily obtained, and may provide a means to 
identify potential dropouts. In addition, questiorinaires administered to students during the freshman year 
would provide relevant information to measure institutional and goal commitment and academic and 
social integration, which are important components of Tinto's model. 

Noble (1988) studied Tinto's model of attrition using ACT Assessment and ACT's Evaluation 
Survey Services survey data as predictors of freshman persistence. The data for the study consisted of 
matched ACT Assessment and Entering Student Survey records, ai,d course aedit and CPA data provided 
by 14 institutions. The results showed that, in combination, ACT student background information, 
Entering Student Survey data, cumulative CPA, and full- time /par I- time enrollment were moderately 
effective in predicting freshman persistence (median multiple R=!,52), However, a crossvalidation analysis 



at tliruL' inbtitutions rcveak'd a large reduction in prediction accuracy. Furthermore, linear regression was 
used tt) develop the predictit^n models. Linear regression is not well suited to modelling dichotomous 
variables. A more appropriate method involves the use of lo^'.<stic Regression. 

The present study was intended to overcome these limitations and problems. The variables 
included in the study emphasize those issues that are important elements of Tinto's model, but that many 
rL^search studies have found difficult to measure adequ a tely-stu dents' expLMZtations of the institution and 
ol thcmsefvL's, and M)Cial .md academic integration. In addition, tlu* data for the study were collected 
throughout the freshman year, during the critical time periouj for student persistence. Apprcximately 
thret»-f(iurths of all dropouts leave sometime during the freshman year (Tinto, 1987); most of these 
dn)pouts leave during the first six weeks of the fall term <Bla;iC, DeBuhr, & Martin, 1982^). Persistence 
studies typically lack inftjrmation that ^vould allow one to identify these at-risk students because they do 
not assess students early in tlie tL'rm. Furthermore, whereas most persistence studies are based on 
outcome data collected at one point in time, this study hivolved multiple data collection points occuring 
before ct»llege entry and thioughotu the freshrnan year. Because of the longitudinal nature of this study 
and the fact that data were collected at critical points in time, it was expected that the findings would be 
moa* comprehensive than those of previous studies. 

Tlie definition of dropout in this study included any student who left the institution during a 
specified time peritxl. Because the students were all entering freshmen and the study followed them only 
through the beginning of the sophomo**e year, it was assumed that any student who left during this time 
did not complete a degnv program. All dropouts, regardless of their reasons for leaving, are generally 
of interest to Institutions. It may be desirable, lu)wever, to use ii\fomnation collected prior to enrollment 
or early in the year regarding individual enmllment plans and goals in deciding whether a student is a 
candidate far intervention. 

Tlu» overall purpose of the study was to determine the validity of using students' ACT Assessment 
scores, background characteristics, perceptions of the college, expectations, and college credit and CPA 
information ft>r predicting college fR»shman persistence. Tl\e focus of this paper is on the utility of 
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perbistence modds> developed during the fiibt year of college, and on potential ways of using the results, 
in developing interventions for high-risk students, 

Tliis study was primarily designed as a statistical exploration and identification of variables that 
are highly related to student persistence. The anticipated result, however, m a practical one: the 
knowledge gained fmm the study will enable institutions to develop efficient methods for identifying 
high-risk students and recommending interventions for keeping them in school. 



Data 

The predictor variabletJ identified for the study were drawn from current research on persistence. 
They included the following, 

A. Ba ckground i nf orma tion 

1. Demographic characteristics (sex, race, age, etc.) 

2. Academic development /ability (ACT test scores, high school GPA, courses taken, 
etc.) 

3. Nature of high school preparation (self-reported adequacy, curriculum type, SES 
level of the school) 

4. Extracurricular participation 

5. Financial (ACT Assessmeiit and ACT Student Needs Analysis System) 

6. Family attitudes towards education (interest and perceived expectations of 
parents, financial support, parents' level of education) 

7. Academic and personal netds (needs for help with writing, study skills, personal 
counseling, etc.) 

8. Self-reported physical health 

9. Self-reported personality characteristics 

B. Initial commitment to institution 

1. Purpose for enrolling 

2. Institutional choice (was the institution their first choice, second, etc) 

3. Importance of selected institutional characteristics for attending the institution 
(social, academic reputation, physical characteristics) 

4. Full-time /part-time enrollment 

C. Initial and subsequent academic goal commitment 

1, Expected degree and strength of expectations 

2. Certainty of career aspirations 

3. Commitinent to and value placed on college education (academic motivation, 
academic relevance) 

4, Actual versus expected progress in reaching academic goals 
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5. 
ft. 



Saiisfaction with ac<idemic progrcbb and services 
Absenteeism 



D. 



Studcnt/ini>titutin - cadcmir til 



1. Docs tht! institution meet the academic expectations of the student 

2. Course enrollment, completion and grades 

3. Tvlced for remediation or advanced (honors) course work 

4. Perception of relationships with faculty, advisors, and staff 



t. 



Student /in htitution social fit 



1. 

2. 
3. 
4. 



Amount ut tri end ship, peer support 
Social relationships with faculty and staff. 
Cumforl and satisfaction with the env'imnmenl 
Extracurricular actiWties 



F. 



Student /institution financial fit 



1. 
2. 

3. 



Amount ot immediate family contribution required to meet expenses 
Hours per week spent working 
Loans required to meet expenses 



The criteritm variable was student persistence. Four separate time periods were examined: 
completion of the fall term, re-enrollment in the spring term, completion of the spiing term, and re- 
enn>llmenl in tlie following fall lenrip For institutions on the quarter system, re-enroUment and complehon 
were also examined for the winter term. 

Twcnty-lhree ACT user institutions were contacted in May, 1989 and invited to participate in the 
study. Of that number, six agreed to participate for all three years, beginning in Fall, 1989. One school 
withdrew from the study during the first lenn because of data collection difficulties. 

The data for the study were drawn from several sources: the ACT Assessment, the ACT 
Institutional Data Questionnaire, the Market Data Retrieval pub ic and private high school files, and three 
questionnaires developed specifically for this study. 

The ACT Assessment is a comprehensive evaluative, guidance, and placement program used by 
moa* than a million college-bound students each year. It consists of four academic tests, self-reported 
high school a>urse work and grades, the Student Profile Section (SPS), and the ACT hiteresl inventory. 
The SPS, which collects information related to family and high school background and preferences v^tU 
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regard U> cnllcj^echaracterisriCb, was Ihu souice of many background characteristics ilems, as well as early 
indicators (*f insritulional and goaJ commitmenl. 

The ACT Institutional Data Questionnaire <1DQ) collects descriptive informati()n about two- and 
four-year pi)stseconriary institutions; the information is used to develop the College Planning/ Search Book 
(ACT, lyyO). Tlie i.istrunrient includes information related to enrollment size, tuition, degrees offered, 
contn)l (public/ private), and selectivity. Itms from the questionnaire wen? used to develop the 
student /institution fit variables fur measuring institutional commitment. 

The Market Data Retrieval (MDR) files provide descriptive information from public and private 
secondary schools throughout the United States. The variables taken fn/m these files were per-pupil 
expenditure, availability of special education, and percent of students in the district with family inaimes* 
below the federal poverty level. 

Thrw survuy instruments were deveU>ped for the study: The Entering Freshman Survey, 
administered immediately before or after fall enniHrnenl; the Enmlled Freshman Survey, administered in 
the middle of the fall and spring terms; and the Withdrawing Student Survey, administered to students 
who withdrew. The Entering Freshman Survey was designed tL> assess students' initial perceived needs, 
expectations, and perceptions related to college in general and to the spt»cirtc college attended. The 
Enn^lled Fn.»shman Survey was designed to measure similar variables and issues, with particular emphasis 
c>n whether and how students' initial needs and expectations were being met. The Withdrawing Student 
Survey was also designed to measure students' perceptions and attitudes, and included students' reasons 
for leaving. This questionnaire was administered to students at the time they withdrew fmm college. 

The Entering and Enrolled survey forms were designed so that it was possible to compute 
discrepancies between a student's responses to comparable items on the two instruments. Discrepancies 
wen* computed by subtracting a student's response on an Enrolled Freshman Survey item ftxim hiscir her 
response on the comparable item fn)m the Entering Freshman Survey. When sprijig Enrolled Freshman 
Survey data became available, the discrepancies were recomputed using Entering and spring Enmlled 
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rL»:,ponbi'S. The objcclivu in using disca»pancii'b was to measuix? changes in btudunts' attitudeb, 
fxpc-ctationb, and goalb from thu time they entered college to the time period of interebt. 

Course ca»dit and grade information for each btudent wab obtained from the inbtitution in wliich 
the student was enrolled. Participating institutions were asked to provide the number of credit hours 
attempted, midterm btatus, number of credit hours earned, and CPA for each term the student was 
enrolled. 

Method 

Data Ceiiection 

The Entering Fa»shman Survey was administered to randomly selected entering freshmen at each 
inbtitution during the freshmen orientation /registration period or in intact classes during the first week 
of bchooi. The Enrolled Freshman Survey was administered shortly after fall midterm and again shortly 
after :*pring midterm. Tlie Withdrawing Student Survey was administered in exit interviews or by mail 
to students who left the college at any point between frebhman enrollment and re-enrollment at the 
beginning t)f the bophomore year, 

Institutions were also asked to pn)vide specific ei"irollment and completion information about their 
fre>hman class at several points during the freshman year. This informahon included name, social 
security number, credit hours enrolled, midterm performana*, credit hours earned, and CPA for the fall 
and spring term and fall 1990 re-enroUmcnt. Midterm information wab obtained from only those 
institutions that routinely collected it. 
Analysis 

PersistL'nce was modelled at four separate points in time: end of fall term, spring term re- 
enrollment, end of spring term, and iv-enrollment the following fall term. For institutions on the quarter 
system, winter term re-enroliment and completii>n of winter term were also modelled. Persistence through 
the end (if the fall or spring terms was examined in two ways: Initially, for each term, models wea» 
developed Ubingonly information a'ceived prior to the term in question and enrollment information from 
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the b'jRiniiing of that term. Subsequently, models were developed that also considered information 
received during the term of interest: the Enrolled Freshmen Survey and midterm performance dara. 

Several steps were necessary to reduce the number of predictors to a manageable set for each 
institution. First, simple correlations were computed between each predictor and the dichotomous 
persistence variable. Any variable that had a correlation coefficient of .10 or higher with perjsistena*, and 
that was statistically significant (p < .05), was considered for inclusion in the prediction model. Tlicse 
variables were then examined with respect to the number of respondents and aintent redundancy. Any 
variable for which there were relatively few observations or that was clearly redundant with other 
predictois was eliminated. The remaining predictors were then entered into a multiple linear itigression 
model of pf*rsistence. Collinearity diagnostics were examined, and variables with high variancc?- 
dea)mposition proportions at high condition numbers were dropped from the model (see Belsley, Kuh, 
and Welsch, 1980). The remaining variables were evaluated in terms of their contribution to and 
statistical significance Cp ^ .05). 

Multiple linear regression is the most conunonly used statistical method for predicting outcomes. 
Htiwever, because linear regression assumes that the criterion, or dependent variable, has multiple values 
that are interval in scale, it is not the most appropriate method for modelling a dichotomous criterion 
variable such as persistence. Because a dichotomous criterion variable is bounded, a linear regression 
might result in impossible values. Although polynomial models might be constructed to fit the data, they 
would be complex and difficult to interpret. 

Another commonly used method for this type of study is discriminant function analysis- Tliis 
method uses one or more continuous/ or metric, predictor variables and a categorical, or nonmetric, 
criterion variable. In discriminant function analysis, individuals are assigned to a group, or categoiy, on 
the basis of their weighted scores on the predictor variables. It could be argued that discriminant function 
analysis would be a suitable method for analyzing these data. However, discriminant function analysis 
assumes a linear relationship between predictors and criterion; when the criterion is a dichotomous 
variable, the problems of linear regression also apply here. In fact, it should be noted that in the case of 
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a dk'hutomdiis crilmon variiUMo. ^uch pLTMijl/dnipiiiit, dibcriminant amlyslb is uquivalLmt In linc»ar 
^u^a's^i(^n iTalbUdka, 1^*71). 

An ,ilH'r!\^ti\'L' inL»ll\(id, h^fti^lic rcgR^su»n, was dt-vuloped -of ic.ill y to deal with dicholomdUb 
dupendunl variablub. Lngibtic rcgrcb^ion asbumcb curvilinear relationbhip^ between the independent and 
dependent variableb; hence, a predictor vari^ible's influence on the outcome is more likely to be reflected 
in the model if curvilineariw is prt*v.Lnit. Finally, lofpsttc a»gn.*5siun computer n>u tines directly estimate 
the proKibilitv o\ the nulLimu- \.>n.flil,' (in thl^ ta^y, perM>tenre) li»r e.ith >ludent on the basis of his or 
her values on the predictor variables. This is very practical and useful information wKii the goal is to 
identity students who are likely to drop out. 

The Logistic Model 

The predictiim model for loRI^tic a'ga'ssitm looks similar to the prediction model for linear 
regR'ssion; the same terms <ippear ni both mcjdels, but have different meanings, fn a linear regression 
model, 

Y =^ a., ^ a^ X, + .... + a„ x„, 
Y is tlie predicted value on the criterion variable, x„ x^. are the observed values on the padictor 
Viiritibles, a„ ib, the inlera»pt term, and a,, a„ are the regression coefficients associated with the predictor 
variables. For a one-unit change in x, thea* is an expected change in the criterion variable equal to the 
size of n,. 

A logistic rc^gression prediction model is formulated as 

Index =^ I '^i ^n' 
where the criterion variable is a dichtitomous variable, such as success /failure on some criterion, and the 
predictors are metric variables. The pn.bability of persistence is: 

p- l/(Ue)- Index, 

where e = 2.718 is the base of natural logarithms. Index is the logarithm of p/d-p), the odds of 

persistence. The symk)ls x, , again denote the 4)bserved values of the predictor variables, and ao 

denotes the intercept In the logistic pa'diclion model, a„ are called the regression coefficiei\ls for 
the predictor variaWes; however, since the Index represents the log odds of persistence, not the predicted 
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value* (if tho critcTiun, a, roprobcnlb the degree cif change in the hig oddb of persiblence (the h\dex) given 
a tine- unit change in x^. 

The independent variables kom Ihe final linear regre.sbion nu)delb were entered inlt> a logiblic 
regre^biiim analysis io predict persiblence. Any cif Ihebe predictors that were nt)l found In be blatibttcally 
significant at p < .05 were removed from the model. 

Results 

Table 1 pn>videb bome dv»bc/iplive information about the inblitutionb participating in the btudy. 
Fo ;r of the five bcbools were 4-ycar inbtilution^. The enrollmentb of the five bchoolb ranged from below 
5,0(10 to over 15,1)01), and four of the bchoolb were public inblitutionb. Table 2 pRivideb belectcnJ debcriplive 
data for the frebhman bample at each of Ihe fivo bchoolb. Tlie ratio of femslcb to males ranged from 50:50 
lo 6S;32. Virtually all of the freshmen at .ill five bchoolb were below the age of 22; a large majority were 
IH or younger. At four of the five bchoolb, ^[)7f or more of the freshman were while, and almost all were 
unmarried. The percentage of btudentb who had a high bchoi>l CPA of 2.50 or lower ranged fRim V7r lo 
^y/f. Fifteen percent to 43*^ of the freshnien'b fathers had gone no farther than liigh bchool, and the 
pera»ntageb were quite bimilar for the mothers. 

Table 3 bhowb the total numbers of students in the bampleb, by bchool and time period, and the 
actual numberb of dropouts. Table 4, in contrast, showb the number of btudent recordb included in the 
final model for each inbtilution and the nuniber of btudentb who dropped out at each time period. Ab 
shown by comparing the numbers in Table 3 to those in Tabic 4, there was a ct)nsiderable amount of data 
loss over time. The percent of data loiit throughout the first year ranged from 26 to 75; the median percent 
loss was 63. As a result, it was not possible to develop prediction models for those time periods. Tliis 
was due to a number of causes: Many students failed to respond to the Enrolled Freshman Survey during 
the fall and /or spring terms. In addition, many of the questionnaires received did not contain a)inplele 
data; hence, varying amounts of data were missing for many of the predictors. Finally, several schools 
identified no students dropping out a » ?rtain penods. 
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Results for the final logistic regressitin models are shuwn in Tabltis 5 through 9. Each table 
provides the predictor variables for cacn institution, along with their associated regression estimates, for 
a given time period. First-term pLTsistencc was motlellcd in two ways: initially, only prc-cnrollment 
variables were used to predict persistence (shown in Table 5); subsequently, fail Enrolled Freshman Survey 
aiid enrollment variables were also included (shown in Table 6). As a result of including the additional 
variables, there was a certain amount of data loss (as shown in Table *♦); therefore models could ncJt be 
developed tor all schools. Tlie same approach was attempted in modelling spring persistence; howevcT, 
the extent of data loss was such that a second model could be developed for none of the five schools. 

Although relatively few individual predictor variables were present in more than one model, most 
of the significant predictt)rs a)uld be gn»uped in terms of the categories described in the Data section. 
The most frequently occurring categories at all time periods were the following: goal commitment (e.g., 
academic goal at this institiition, number of cofdits, expect to complete freshman year, certainty of career 
choice), institutional commitment (e.g., I like this college, satisfaction with academic reputation of college, 
satisfaction with availability of major), and academic fit/integratit)n (e.g., number of credits dropped, 
number of credits earned, CPA, availability of courses wanted, use of academic advising). Higli school 
preparation /background (e.g., number of math courses taken, ACT Mathematics score, high school CPA, 
high school per pupil expenditure) was also significant for most time periods. Plans to work while in 
school was important in predic ting first-term persistence at two schools, but was not a significant predictor 
for later time periods. Anotlic»r issue that appeared significant only to first-term persistence was state 
residency classification, which is related to tuition. Btjth plans to work and nonresidence stains might be 
interpa'ted as indicators of financial stress. Social fit indicators (e.g., use of ct)Ilegc-sponsored or off- 
campus activities and programs) were significant predictors in a few instances. Finally, a few variables 
related to personality wea» significant (like school, enjoy being with pcH)pIe socially). 

The discrepancy variables examine the points in Tinto's model at which expectations meet actual 
experiences, and goal and institutional commitments are re-evaluated. In several of the models, 
discrepancies between responses on Entering Freshman Survey items and corresponding items on die fall 
or spring Enrolled Freshman Survey were statistically significant pa'd letters. For these items, a higher 
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valur <Hi tlic laUT burvey than on Uie earlier survey crented negfltive valuet. on the discrepancy varinbleb. 
G^nversely, if the later response wab lower khan the initial response, the discrepancy was pnsitive. 

Some of the relationships between survey response discrepancies and persistt?nce were not 
intuitively obvious. For example, as shown in Table 9, the discrepancy between studentb' initial 
satisfaction and later satisfaction with employment opportunities was positively related to persistence. 
This suggeshi that if students' satisfaction witli thuir employment opportunities decrers^ed over time, they 
were more likely to persist. If their satisfaction increased, they were more likely to drop out, A pobsible 
explanation for this is that students who find that they have better emplo^ ment opportunities than they 
expected will be likely to drop out of school to work, or fail as a result of working too many 'lours while 
in school. 

In Table 7, the discrepancy between initial and later concern about having to drop clab^e^ due k) 
poor grades was a statistically significant predictor for Institution 5. Students who became more 
concerned appeared more likely to persist, while students who became less concer ed were more likely 
to not return the second term. It is possible that the students who indicated increased concern had a 
realistic sense of the challenges they faced, and consequently worked harder to perform well. On the 
other hand, students who expressed less concern may have bucome apathetic about school or unrealistic 
abt>ut expectations and consequently dropped out. 

The results support Tinto's assertion that models of student pi»rsistence should be institution- 
specific. For example. Table 9 shows that in predicting second year re-enrollment measures of academic 
fit were important at three of the five institutions, social fit indicators were significant at two institutions, 
and goal commitment indicators were important ar two of the five institutions. This, as well as the fact 
that no specific variable appeared in more than one model (in Table 9), indicates that characteristics of 
institutions uniquely interact with characteristics of their particular student populations. 

The results are consistent with previous research using Tinto's model. Other studies have found 
that goal and institutional commitment arc key predictc<rs of student persistence (e.g., Hackman &. 
Dysinger, 1970; Cope & Hannah, 1975; Noble, 1988; Webb, 1989). Furthermore, other research has found 
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academic fit to be a more salient predictor of persistence than social fit (e.g., Munro, 1981; Halpin, 1990). 



Practical Utility of Logistic Regression Results 

Using the predicted probabilities of success from a logistic regression model, prediction accuracy 
can be examined by constructing a simple derision table such as the one shown below: 



Actual outcome 


Predicted ou rome 


Below aitical vaUe 


Above critical value 


Purbist 


A 


B 


Dri>p out 


C 


D 



It is possiible to identify a critical point on the scale of obtained index values; a student whose 
index value is at or above the critical point would be predicted to persist, and a student whose index 
value ib below tl^e critical point would be expected to drop out. Observations are categorized into one 
of tht* ftiur possible outcomes. B is the number of "true positives", that is, the number of students who 
were expected to persist and actually did; C is the number of "true negatives — the students who were 
predicted to drop out and actually did. A represents the false negatives and D represents the false 
positives; these are the groups for which incorrect decisions were made. B + C represents the number of 
students for whom correct decisions were made; when presented as a proportion of ilip lotaJ group this 
is also referred to as the accuracy rate. 

Retention Programs: Identification and Intervention with High-Risk Students 
The information provided by logistic regression models can be used to assist in identifying high- 
risk students and designing intervention strategies to address their needs. The proportion high-risk, 
probability of retention, and accuracy rale vaiues can be used, with the Index scale, to set critical points 
for identifying high-risk students. The logistic regression model provides flie variables most strongly 
associated with persistence for each institution. High-risk students' performance or responses on these 
variables can be used in identify ij'q areas where interventions can be focused. 
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Before establishing a retention progi-am for identifying and intervening with high-risk students, 
an institution must consider several factors not addressed by statistical analysis: students attend and drop 
out of college for many reasons. According to Tinto C1987), a student dropout should be considered a 
failure only if both the student and the institution fail to meet their goals. If the student's intent is to 
attend school for a year, and then transfer, this does not necessarily make him a "failure" at the school. 
A goal of some institutions may be to minimize student transfer; others may encourage it (e.g., two-year 
colleges). Further, many variables related to student persistence may not be under the control of 
institutions (e.g., race, gender, health status). Consequently, it may not be feasible or cost-effective to 
attempt intervention with some students. Each institution must detern^ine, before implementing a 
retention program, what its goals are with regard to student persistence, the types of dropouts with which 
it wishes to intei^ene, and the resources to be made avaiUible to the program. 
An Example 

The logistic regression results for Institution 2 across all four persistence time periods are shown 
in Table 10. For each time period, the sigiuficant (p < .05) predictor variables are identified, with their 
corresponding regression weights. As shown in the table, the significant predictors were not constant over 
time; in fact, the student's goal in attending the institution was the only recurring variable. 

The regression weights describe the direction and strength of the relationship between each 
variable and student persistence. For example, the i\uqaber of hours a student planned to work was 
negatively related to fall term persistence at this school; as the number of hours a student planned to 
work increased, probability of dropping out increased. For fall term persistence based on pre-enrollment 
and fall survey data, the discrepancy between expected completion of the freshman year at entry and at 
mid-fall term was positively associated with persistence. Increased expectations of completion were 
associated with higher probabilities of staying in school. Per-pupil expenditure for the high school 
attended was negatively associated with spring term persistence; students who attended wealthier high 
schools were more likely to drop out. Further, students who were more satisfied with job opportunities 
during the mid-spring term than at the beginning of the year were less likely to persist through the 
beginning of their sophomore year. 
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In summary, institutional and goal commitment variables initially tended to be significant 
predictor variables for iitudents at this institution. For later time periods, however, academic and social 
student/institutional fit variables predominated, though goal and institutional commitment variables were 
still present. 

Figures 1 through 4 illustrate the prediction accuracy of these models using three indicators of 
prediction accura y. Each figure corresponds to one of the four time periods for the institution whose 
prediction models are summarized in Table 10. The horizontal axis represents the Index scale, computed 
from the relevant logistic regression model. The vertical axis represents a probability, or proportion, 
associated with each of the three indicators shown in the Figure. The probabLlily of retention curve 
indicates the probability of persisting in college for a student with a given Index value. As the Judex 
value increases, the pmbabilily of retention increases to its maximum value. The prcoability of retention 
is always 50% for an Index ^^alue of 0. 

The proportion high risk and accuracy rate curves can be used in setting a criticiil point on the 
Index scale. For each point on the Index scale, the curves illustrate the expected results of using that 
Index value as a cutoff point for identifying high-risk students. The proportion high-risk curve indicates 
the proportion of students who would be identified as high-risk for a giv^n cutoff on the index scale. Tliis 
curve increases as the cutoff value increases; at the highest Index value all students would be flagged as 
high-risk. The accuracy rate curve indicates the proportion of students correctly classified for any given 
cutoff point. 

From Tab'e4, it can be seen that the actual dropout rate during the first term at Institution 2 was 
about 10% of the sample. Hypothetically, tliis institution could choose to intervene with only 5% of its 
students. The costs of intervention would need to be weighed against the benefits to determine the best 
target percentage, however. Figure 1 can be used to determine the expected results of such a decision. 
Drawing a line across from .05 on the vertical axis to the proportion high-risk curve and then down to 
the Index scale, the resulting Index cutoff value would be 1. This would mean that a student with an 
Index value at or below 1 would be identified as high-risk. The probability of retention curve shows that 
a student with an Index value at or below 1 would have about a 747o or lower probability of staying in 
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school. Correspondingly, the accuracy rale associated wilh an hidcx value of 1 is aboul .9, meaning that 
about 90% of the students would be correctly identified. Note that the accuracy rate begins to decrease 
at this point, as the Index is increased; if the Index cutoff were set at 2, the accuracy rate would be .8. 
The greatest prediction accuracy would be achieved when the Index value associated with the maximum 
<iccuracy rate is used. 

Similar decisions cati be made about the otlier rime periods for institution 2, as shown in Figures 
2, 3 and 4. For first term persistence (using pre-enrollment and first term data), as shown in Figuie 2, an 
Index cutoff value of 1 v/ould identify somewhat less than 5% of the students as liigh-risk, wilh a 
probability of about 75% of staying in school, and an accuracy rate of .98, For spring term completion and 
fall re-eiirollment, slightly lower index cntoH values might be identified, due to the relarively gradual 
decline in the accuracy rate curve. A cutoff value of .5 might be used to maximize the accuracy rate while 
still targeting a relarively small pR)portion of high-risk students. Approximately 2.5% of the students 
would be identified as high-risk in the spring tenn, and 8% would be identified as higlvrisk for fall re- 
enrollment persistence. 

Targeting Correlates for Developing Intervention Strategies 

Tables 11 through 14 list selected students from each time period that were flagged as high-risk 
using the Index values suggested in the previous secrion. The Index value for each student is given, along 
with his or her responses or performance on the significant predictor variables. 

The results shown in Table 11 highlight several areas for potential intervention during the fall 
term. In developing interventions, schools may want to look at the intentions of high-risk students with 
regard to educational goals. For some students who plan to take only one or two courses then leave, it 
may not be worthwhile to intervene. For example, of the 41 high-risk students (only a subset of the 41 
are shown) at Institution 2, four attended the college for self-improvement, to take a few courses, or with 
the intent of transferring. Depending on the insriturion's definition of dropout, these students might or 
might not be targeted for intervention. 

Of the 41 students identified as higlvrisk at Institution 2 during the fall term, 18 (44%) had no 
definite purpose in mind for attending the school. An investigarion of t]ie career and educarional 
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counbding provided by the school, the extent to which information about these services is made available 
to students*, and the form in which the information is given to students (e.g., orientation, advising, written 
materials) might provide additional information for future revision or modification. It is possible that, 
with additional education and career information and guidance, these students might have persisted. 
Further support fijr this investigation is found by examining the mathematics course work and tlie 
student's gi>al in attending the school. Twenty-five of the 41 students had taken 1 or fewer mathematics 
coarseii in high school. Further, of the 14 student? who intended to obtain a Bachelor degree, 1 1 had 
taken no mathematics course work in high school. Nearly all of these 11 students were eiiroUed fu[I-time 
and were ;-,^nning to work 20 to 29 hours weekly, In fact, for tlie total group, all but 4 were enrolled full- 
time, and all but 7 were planning to work at least 20 hours each week. 

The results for fall persistence using pre*enrollment and /all survey data, as shown in Table 12, 
were less clear for the purpt^so of identifying possible student interventions. Only two of the 27 identified 
studc^ntb (of which a sample is shown) were planning lo transfer, six had no definite plans, and three were 
taking courses for self-impnjvement. All others were planning to complete an Associate or Bachelor's 
degree. Of the 27 students, 16 changed their expectation of completing their freshman year from "yes" 
to "no" between entry .Jid mid-fall term. Four students indicated at midterm that they intended to 
complete the year, but did not persist. Using this information as a preliminary indicator, other variables 
such as midterm grades and satisfaction with the institution could be examined to further identify factors 
related to these students' dropout behavior. 

For spring term persistence, a key variable related to student persistence was the difference 
between credit hours enrolled and credit hours earned in the fall term. As shown in TaWe 13, the 
studei\ts identified as high-risk typically lost over 9 credit hours in the fall term, and their CPAs from 
the courses ft)r which they did ean\ credit tended to be less than 1.00 (less than a "D"). Only two students 
had no definite purpose in attending; all others were planning to acliieve certification (1) or to complete 
an Associate (2) or Bachelor's (8) degree. Investigating the hours the students wea» working, the credit 
hours enrolled, the college courses in which they were enrolled, and their high school course work would 
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provide further informalion about why the students were failing. Additional guidancc/i^d vising about 
appropriate college course work and academic support might reduce the potential for shjdent fc^ilure. 

The list of high-risk students for re-enrollment in the sophomore year is provided in Table 14. 
Two findings are clear; over 50% of these students were not degree-seeking students, and these students' 
responses about liking the college tended to be negative. Eight of these students were p!anni?ig to 
transfer, and 6 had no definite plans in attending the school. The undecided students might benefit from 
educational and career counseUng, as noted in discussions of earlier time periods. In addition, examining 
the students' responses regarding their satisfaction with specific aspects of the college would assist in 
identifying potential areas needing improvement and intervention. 

In conclusion, the results from this particular institution illustrate a few important p(nntb. it is 
apparent tliat later dropouts are more likely to be academic failures, as seen in the spring term persistence 
data, or to have entered the institution with shorter-term goals, as seen in the sophomore re-enrollment 
data. The earlier dropouts, of which there are generally more, may be the more difficult students to 
identify as at-risk. In addition, they may offer more opportunities for successful interventions. 

Discussion 

TTiere were several problems that may have hindered the iiiterpretability of this study. Future 
studies of this type should take care to address these problems. 

Because of the fact that there were multiple data collection points tlvoughout the year, there was, 
inevitably, a certain amount of data loss. Each time a survey was administered, some portion of the 
original sample failed to respond. Tliis amount varied, depending on the administration methods that 
each school employed. For example, although this was strcngly discouraged, some schools administered 
the Enrolled Freshman Survey by mail, in otlier cases, !?chools administered the surveys in freshman 
classes, but simply did not have any particular classes in which most freshmen enroll; this was a more 
severe problem in the spring term. 

Another source of data loss resulted from the fact that questionnaire respondents often leave some 
items blank, perhaps due to oversight, fatigue/boredom, or uncertainty about how to answer an item. 
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In multiple linear or logistic regression procedure, such as those used in this study, a case-wise deletion 
process is typically used to eliminate missing data. It is possible to use some method of estimatmg values 
for missing data in incomplete cases, which would alleviate this problem, (t should be noted, however, 
thai the most straightforward methods of value estimation may result in biased regression parameters; 
more appropriate methods would involve iterative techniques that prove lo be expensive and time 
consuming (Anderson, Basilevsky, and Hum, 1983). 

An unexpected statistical difficulty was the relaiively low proporrions of students who withdrew. 
The attrition rates were, in most cases, significantly lower than were expected. At some institutions, the 
intercept-only model gave a high probability of persistence, and there was very little to gain by adding 
predictor variables to the model. In such cases, there were greater opportunities for chance factors to 
influence results. It is possible that the first-term attrition rates in this study were artificially low because 
students who withdrew very early in the term were not reflected on the initial enrollment files. One oftlie 
institutions in the study was able to confirm tliat the enrollment files were only accessible after the first 
few weeks of class had passed, thereby eliminating any record of students who had already withdrawn. 
In addition, it was noted that percent of data loss due to nonrcsponse or incomplete surveys tended to 
disproportionately reduce the number of dropouts. That is, students who were likely to drop out were 
also less likely lo complete the surveys. In future persistence studies, some of the persistence periods 
could be combined in order to increase the number of dropouts in eadi period. 

Finally, there were many predictor variables included in the study. Because of the large number 
of variables included in the study, it was unlikely that a particular set of variables would consistently 
emerge as significant predictors across all institutions. The exploratory nature of the research made it 
desirable lo include variables addressing all constructs tl^at past research had shown to be related to 
retention. Furthermore, the study specifically addressed the question of how dropouts at different periods 
during the freshman year differ; consequently, it was important to collect data at several different points 
in time. In the follow-up study currently underway, factor analyses have been performed with the survey 
variables with the goal of stabilizing results by replacing individual variables with composite variables. 
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Table 1. Description of Participating Institutions. 



School 


Region 


Type 


Control 


Admissions 
policy 


EnroHment 
range 


Number 
of 

students 
studied 


Sampling 


Calendar 
type 


1 


Midwest 


4 yr. 


Public 


Selective 


over 15,000 


2100 


Random 
classes 


S 


2 


Mtn/Plns 


4 yr. 


Public 


Liberal 


5-15,000 


1400 


Other 


S 


3 


East 


2yr. 


Public 


Open 


1-5,000 


600 


Random 


S 


4 


Midwest 


4 yr. 


Private 


Selective 


1-5,000 


450 


Whole 


s 


5 


West 


4 yr. 


Public 


Selective 


1-5,000 


1400 


Other 


Q 



Note: The "other" category of sampling refers to representative samplet* of students enrolled in specific 
classes. 
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2, liack^round and Lducdtional L'h«r.icterisdcs of Students by School (in percent). 





School 


Characteristics 


1 


2 


3 


4 


5 


Malcb 


50 


40 


39 


37 


32 


Afic = 17-18 


H4 


83 


88 


93 


87 


Agv = 22 or older 


2 


1 


1 


<1 


11 


While 


40 




91 


97 


S4 




4 


1 


2 


<1 


3 


Abian 


2 


<1 


<1 


<1 


h 


Single 


48 


^9 


99 


99 


100 


High school CPA of 25(1 or Icsb 


7 


15 


33 


3 


1 


Fathcr'b cd Icvel-HS diplorrui or \vbb 


35 


40 


43 


26 


15 


Mother's cd lcvcl-H5 diphnrici or lci,.s 


37 




39 


25 


15 
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Table 3. Sample Size and Number of Dropouts for Each Retention Period, by Institution 



Retention period 


Institution 1 


Institution 2 


Institution 3 


Institution 4 


Institution 5 


N 


Dropouts 


N 


Dropouts 


N 


Dropouts 


N 


Dropouts 


N 


Dropouts 


Fall persistence 


1508 


25 


994 


95 


478 


73 


439 


22 


333 


6 


Spring re-cnnillment 


14K3 


76 


899 


1A7 


4()5 


9 


417 


84 


327 


28 


Spring pcTbibtence 


14()7 


23 


732 


4 


396 


y 


333 


3 


299 


3 


Sophomore re-enn>llment 


1384 


158 


879 


151 


387 


15 


330 


48 


29f) 


36 



2S 
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Table 4. Sample Size and Number of Dropouts for Each Prediction Model, Ijy Institution 



Retentionjeriod 


Institution 1 


Instihition 2 


Institution 3 


Institution 4 


Institution 5 


N 


Dropouts 


N 


Dropouts 


N 


Dropouts 


N 


Dropouts 


N 


Dropouts 


Fall porsibtencc (using 
pro-enrt)llment data only) 


1293 


13 


m 


84 


417 


62 


9U1 


79 


328 


5 


Fall poriiitence (ufaing 
pro-enrollment and fall 
cnn)Umciit data) 


937 


7 


751 


42 


244 


22 


236 


5 




(1 


Spring rc-L'nrol linL'n t 


g4l) 


42 




0 


222 


7 


226 


29 




14 


Spring peri.i>tc*nce 


W2 


16 


7H1 


14 


396 


9 




0 




n 


SnphoniDn: rc-onruUmynt 


412 


2U 


335 


52 


307 


11 






146 


17 



•For inbtitution 4, i>i)phomorL» rt.'-i.'nn)llment infurmaticm was. not available. 
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Table 5. Logistic Regression Models for Predicting First-term Retention, by Institution, 

(Pre-enrollment Data) 







Regression 


Institution 


Predictor variable 


weight 


1 


Credit hours enrolled 


0.51 




High school extracurricular activities 


U.38 




Credit hours x extracurricular activities 




2 


Number of math courses taken in high school 


0.41 




Goal in attending this school 


0.14 




Number of hours plan to work 


f\ At 




FuU- time/ part- time enroilment 


2.26 


3 


Enjoy school 


1.96 




Number of hours plan to work 


-0.30 




Goal at this school x credit hours enrolled 


0.02 




Glial at this school x satisfaction with academic reputation 


0.09 




Goal at this school x enjoy school 


-0.25 


4 


Credit hours ennilled 


0.28 




ACT Mathematics score 


0.09 




High school athletic acQimplishments 


0.13 


5 


- High school CPA 


1.52 




Importance of beauty of campus/ buildings 


1.96 
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Table 6. Logistic Regression Models for Predicting First-term Retention, by Institution 

(Prc-cnrollmcnl and Fall Enrolled Data) 



Institution 


Predictor variable 


Regression 
weight 


1 


Residency classification 
- Importance of entrance RHrjuiremcnts 
*- Does college offer all courses students want 

(mid-fall) 


-3.18 
-1.46 
-1 .97 


2 


ExpuLl to aimpleU' treshmrUi year (at I'ntry) 

- Diijcrepancy: expi'd tu complete freshman year 
(entry minub mid -fall term) 

- College located in home blMc 


-4.31 
2.4K 

1.80 


3 


- Satisfaction with availability of a particular 
major (mid-fall) 

- Discrepancy: expected gnides minus actual fall 
grades 

- Discrepancy: expect to complete fR»shman year 
(entry minus mid -fa 11 term) 


.47 
1.54 


4 


- Use of college-sponsored social activities 

- Discrepancy: find a stimuJating intellectual 
atmosphere (entry minus mid-fall term) 


3.05 
-1.1 1) 



*Survey ilem was reverse coded; higher values assigni»d to negative responses 
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Table 7. Logistic Regression Models for Predicting Second-term Re-enrollment, by Institution 







1 

Regression 


Institution 


Predictor variable 


weight 


1 


- Importance of facilities for the handicapped 


-,82 




(mid-fall) 






- 1 like attending this college (mid-fall) 


.64 




*- Expect to complete freshman year (mid-fall) 


-2,(16 




- Fail CPA 


1.02 


3 


- Satisfaction with recognition of prior credit 


1.74 




earned (at entry) 






- Certainty of career choice (mid-fall) 


-4.35 




- Number of crcditii dropped in fall 


-,59 


4 


- Importance of recognition of prior credit 


.92 




earned (at entry) 






- Use of academic advising (nrsid-fall) 


-1,35 




Use of credit'by-exam programs (mid-fall) 


1,73 




- Number of hours spent studying (mid -fall) 


-.68 




Number of credits earned in fall 


-,21 


5 


- Discrepancy: concern about having to drop 


-1,11 




classes due to poor grades (entry minus mid- 






fall term) 






- Library services /facilities will be sufficient (at 


1.02 




entry) 






Enjoy being with people socially (at entry) 


1.01 




- Importance of recognition of prior ca»dit 


1.39 




earned (mid-fail) 






Does college offer all courses student wants 


-1,74 




(mid-fall) 






- Number of credits dropped in fall 


-,54 



*Survcy item was reverse coded; higher values assigned to negative responses. 



35 



29 

Table 8. Logistic Regression Models for Predicting Second-term Retention, by Institution 



Institution 


Predictor variable 


Regression 
weight 


1 


- Satisfaction with the coUcgo (mid-fall) 

*- Expect to ajmplote frcbhman year (mid- 
fall) 

- Number of clabbcb mibbod per week (mid-fall) 


.HI 
-I.IK 

-1.24 


2 


- Per pupil I'xpcnditiirc lor high bchoi)! 
attend uU 

- Credit ht>urb druppc*d in fall tenn 

- Fall CPA 


-.n 

-.24 
1.14 


3 


- Number of credib dropped in fall tc-mi 


-,71 



•Survey item wa:* rovcTse ci)ded; higher values absigned to negative rebponi>eb. 
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Table 9. Logistic Regression Models for Predicting Sophomore Re-enionmen^ by Institution. 







Regression 


Institution 


Predictor variable 


weight 


1 


- Discrepancy: satisfaction with academic 


,92 




reputation (entry n\inus mid -spring 






term) 






- Use of culharal programs (mid-spring) 


M6 




- Sprins GPA 


.66 




- Discrepancy: satis/action with 


.52 




opportunities for employment (entry 






minus mid-spring term) 






- Discrepancy: expected grades minus 


-.66 




self-reported grades mid-spring term 






- Goal in attending this school spring term 


.24 




- 1 like attending this school (mid spring) 


,95 


3 


- Discrepancy: highest education level 


-1.06 




c'vnectod (entrv minus mid-sorinff tprm) 




4 


- Discrepancy: importance of entrance 


-1.48 




requirements (entry minus mid-spring 






term) 






- Concern about having to drop out 


-143 




because of poor grades (mid-fall) 






- Need help with study skills (BPS) 


-1.56 


5 


- Opportunities to participate in off- 


1.10 




campus cultural/recreational activities 






(at entry) 






- Availability of courses you want at times 


-1,09 




you can take them (mid-fall) 
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Table 10, Persistence Models for a Particular Institution, by Persistence Period 



Persistence period / predictor variables 


Significant predictor variable 


Regression 
weight 


Fall term / prc-enrollment variables 
only 


Number of mathematics taken in high school 


.41 


Goal in attending this school 


.14 


Number of hours plan to work 


-,41 


Full-time /part-time enrollment 




Fall tfrm / pru-t;nri>llmcnl and fall 
Enrolled Survey variablc-s 


Expect to complete freshman year (at entry) 


-4.33 


Discrepancy: expect to complete freshman year 
(entry minus mid-fall term) 


2.4« 


College located in home state 


1.80 


Spring term / all fall variables 
Cinduding credit hours earned and 
CPA) 


Per pupil expenditure for high school attended 


-.13 


Credit hours dropped in fall term 


-.24 


Fall CPA 


1.14 


Rc-enrollment fall form sophomore 
yoar / 

all variables (spring Enrolled Survey 
variableb) 


Discrepancy; satisfaction with opportunities 
for employment (entry minus mid-spring term) 


.52 


Discrepancy: expected grades minus self- 
reported grades mid-spring term 


-.66 


Goal in attending this school spring term 


.24 


I like attending this school 


.95 
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Table 11. Selected Variables Related to First Term Persistence for Selected Students (Pre-eiirollment Variables Only) 



Student 


Index 


Probability 
of 

persistence 


Number of 
mathematics courses 
taken in high school 


Goal in attending 
this school 


Number of hours 
plan to work 


Full/part time 
enrollment 


1 

I 




Ai. 


-J 


No plcinij 


4(1 or m^ire 


Part 


2 


_1 (1 


27 


\ 




20-29 


Part 




-H h 




2 


LJ«Lllt.lLJr J LiCk'n.L 


20-29 


Part 


4 


-0,1 


,48 


(] 




20-29 


Full 


5 


0,1) 


.50 


{] 


l?acht.*lor'& dvj^njc 


40 or mt^re 


Full 


^ 


0.3 


.57 


1 




20-29 


Full 


7 


0.3 


.57 


0 


No plans 


10-19 


Full 


H 


(;.4 


.ftO 


0 


BacbL'lor'b dcgrou 


30-39 


Full 


9 


05 


.62 


0 


A&bociate degree 


20-29 


Full 


10 


0.6 


.65 


3 


Few ctJursc'b 


40 or more 


Full 


1 1 


0.7 


.67 


{) 


Transfer 


20-29 


Full 


12 


0.7 


.67 


2 


No plans 


20-29 


Full 


13 


().« 


.69 


0 


BaclK'lor's dcgrct* 


20-29 


Full 


14 


().« 


.69 


1 


Sdf-improvcmcnl 


10-19 


Full 


15 


0.8 


.69 


1 


Vocationhl- 
tochnical 


20-29 


Full 


1ft 


1.0 


.73 


2 


AbsdCiatc degree 


.^0-39 


Full 


17 


l.l) 


.73 


0 


Ab&ociate degree 


10-19 


Full 



40 
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Table 12. Selected Variables Related to Fall Term Persistence for Seiected Students (P re-enrollment and Enrolled Survey Variables) 



Student 


Index 


Probability of 
persistence 


Expect to complete 
freshmen year 
(at entry) 


Expect to complete 
freshman year (mid 'fall 
term) 


Discrepancy 
(entry - mid- 
fall) 


College in 
home state 


1 


-2.9 


.05 


Undecided 


No 


-1 


Yob 


2 


-2.9 


.05 


Yes 


No 


-2 


No 


3 


-1.0 


.27 


Yes 


No 


-2 


Yes 


4 


-0.4 


.40 


Undcddud 


Undecided 


0 


Yet, 


5 


H.2 


.55 


No 


Yet, 


2 


Yet, 


6 


0.3 


.57 


Undecided 


Yc^ 


1 


No 
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Table 13. SeJected Variables Related to Spring Term Persistence for Selected Students (All Fall Variiibles) 



Student 


Index 


Probability of 
persistence 


HS per-pupLl 
expenditure 


Number of credits 
dropped in fall 
term 


Fall CPA 


1 


-2.1 


.11 


$36l}l}-$379y.»^4 




0.0 


2 


-l.« 


.44 


$32l)()-$3399.9M 


\b 




3 


-1.7 


.15 


$34lH)-$3549.44 


15 


(U) 


4 


-1.6 


.17 


$26t}(}-$279y,44 


17 


0,0 


5 


-1.2 


.23 


or inurL^ 




0.7 




4).'S 


.38 


$44{)n-$4fi99.y9 


12 


1.0 


7 


-11.4 


.41) 


$4()()()-$-17yy,y9 




{\h 


H 


-l).3 


.43 


$3()lX)-$3199.4y 


12 


0.37 


y 




.50 


%mm ur more 


fi 


1.10 


m 


U.l 


.52 


$36(K)-$374M.y4 


10 


0.66 


n 


(Kl 


.52 


$2hlK}-$27MM.99 


11 


0.23 


12 


i)A 


.M 


$26()(l-$2749.49 


11 


O.Sll 


n 


D.S 


hi 


$32(l()-$3344.94 




0.5.1 



4E 
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re 14. Selected Variables Related to R«-enrolImcnt, Sophomore Year for Selected Students (All Variables) 



Student 


Index 


Probability 
of 

persistence 


Opportunities for 
employment (SA-SD 
at entry)* 


Opporlvntties for 
employment (SA— SD 
mid-spring term)* 


Discrepancy 
(entry ■ mid- 
spring) 


Expected 
grades 


Spring 
midterm self- 
reported grades 


Grade discr. 

(entry - 
mid-spring) 


Goal in 
attendiitg this 
college (spring) 


1 like attending 
this college 
(SA-^D)* 


I 


-\Jb 


.17 


Agree 


Agree 


0 


A 


B 


1 


Transfer 


Strongly 
disagrt^.^ 


2 


-1.4 


.20 


Agree 


Agree 


0 


C 


C 


0 


No plans 


Disagree 


3 


■13 


.21 


Agree 


Neutral 


1 


c 


D 


I 


Self- 
iinpTovemenr 


Disagree 


4 


-12 


23 


Neud-al 


Neutral 


0 


B 


C 


I 


No plans 


Neutral 


5 


-1.2 


.23 


Agree 


Sn-ongJy 
agree 


-i 


B 


C 


1 


Transfer 


CKsagreo 


6 


-1.1 


25 


Disagree 


A^^B 


-2 


B 


B 


0 


Few couj&es 


Neutral 


7 


-1.0 


.27 


Agree 


Strongly 
agree 


-I 


B 


B 


0 


No plans 


Neutral 


8 


-0.9 


.29 


Agree 


Agree 


0 


B 


B 


0 


Transfer 


Strongly 


9 


■0.5 


.38 


Agree 


Agree 


0 


C 


C 


0 


No plans 


Neutral 


10 


-04 


.40 


Agree 


Agree 


0 


A 


B 


1 


Bachelor's 


Ellsagrcc 


11 


■0.2 


.45 


NeulraJ 


Agree 


■1 


c 


D 


1 


Transfer 


Neutral 


12 


-0.1 


.4a 


Agree 


Neutral 


1 


B 


C 


1 


Transfer 


Djsagret^ 


13 


00 


.50 


Neutral 


Neutral 


0 


B 


B 


0 


Transfer 


Disagree 


14 


0.3 


.57 


Agree 


Agree 


0 


C 


C 


0 


Bachelor's 
degree 


Disagree 




03 


.57 


Agree 


Neutral 


1 


B 


c 


1 


No plans 


Agree 


16 


0.4 


.60 


Agree 


Agree 


0 


B 


A 




Self^ 
improivemcnt 


Neutral 


17 


0.5 


.62 


Strongly 
agree 


Agree 


1 


C 


B 


•1 


Bachelor's 
degree 


Strungh' 
disagTL-e 



Stale range!, from stnmgly afiroc ISA) Ui •■trongly disagree (SD) 
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